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14.  ABSTRACT 

Prostate  cancer  is  a  heterogeneous  disease  characterized  by  diverse  outcomes.  A  subset  of  patients  originally 
diagnosed  with  low  risk  disease  go  on  to  be  diagnosed  with  high  grade  disease.  It  is  important  to  determine  to 
what  extent  this  is  due  to  undersampling  on  the  initial  biopsy,  and  to  what  extent  low  grade  disease  evolves.  In 
addition,  multiple  genetic  alterations  are  associated  with  disease  evolution  in  response  to  therapy.  This  project 
aims  to  characterize  evolution  of  prostate  cancer.  Completed  work  used  exome  sequencing  to  evaluate  the 
relationship  of  low  grade  foci,  high  grade  foci,  and  metastases  in  radical  prostatectomy  specimens.  This  work 
demonstrates  that  coincident  low  and  high  grade  disease  are  distantly  related,  indicating  that  an  early  parental 
clone  can  give  rise  to  both  low  grade  and  high  grade  disease.  Conversely,  lymph  node  metastases  are  closely 
related  to  high  grade  cancer.  Alterations  in  the  TP53  pathway  are  associated  with  high  grade  disease.  Aims  still 
under  investigation  are  evaluating  the  relationship  between  coincident  conventional  prostate  cancer  and  the 
aggressive  variant  ductal  adenocarcinoma,  and  using  circulating  tumor  cells  to  evaluate  the  evolution  of  castrate 
resistant  metastatic  cancer  from  primary  foci. 
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Introduction 


Prostate  cancer  is  a  heterogeneous  disease  characterized  by  diverse  outcomes.  A 
subset  of  patients  originally  diagnosed  with  low  risk  disease  are  later  diagnosed  with  high 
grade  disease.  Measures  taken  to  reduce  these  events  depend  on  to  what  extent  this  is 
due  to  undersampling  on  the  initial  biopsy,  and  to  what  extent  low  grade  disease  evolves. 
Whereas  it  is  unclear  if  low  grade,  indolent  cancer  evolves  into  high  grade,  aggressive 
disease,  select  genetic  alterations  are  associated  with  evolution  of  advanced  disease  in 
response  to  therapy.  This  project  aims  to  characterize  evolution  of  prostate  cancer  in 
both  localized  and  metastastic  disease.  One  aim  seeks  to  evaluate  evolution  of  localized 
disease  to  high  grade  disease  or  aggressive  variants.  The  second  aim  seeks  to 
characterize  evolution  that  occurs  in  the  progression  of  localized  prostate  cancer  to 
metastatic  disease. 

Keywords 

Cancer  genetics,  tumor  evolution,  tumor  heterogeneity,  prostate  cancer,  exome 
sequencing 

Overall  Project  Summary 

The  research  project  proposed  within  this  award  was  divided  into  two  specific  aims,  with 
sub  aims  further  delineated  within  each  aim  and  specific  tasks  outlined  within  the 
statement  of  work.  This  portion  of  the  report  will  be  organized  by  specific  aim  and 
according  to  the  tasks  outlined  in  the  SOW. 

Aim  1:  Investigate  the  molecular  relationship  between  coincident  prostate  cancer  foci. 
Aim  la:  Test  the  hypothesis  that  coincident  low  and  high  grade  foci  represent  evolution 
of  a  single  clone. 

Matched  low  and  high  grade  foci  from  prostatectomy  specimens  underwent 
exome  sequencing  to  an  average  depth  of  75  million  reads  (Table  1).  Five  specimens 
were  sequenced.  One  specimen  was  sequenced  twice  but  with  low  quality  data  each 
time,  and  so  data  from  four  specimens  were  used  for  evaluation.  Sequences  were  aligned 
and  301  potential  somatic  mutations  were  identified  using  VarScan  and  custom  scripts. 

A  portion  of  high  confidence  somatic  mutations  identified  through  VarScan  were 
confirmed  via  Sanger  sequencing  and  targeted  capture  and  resequencing  (Figure  1). 


Table  1.  Characteristics  of  sequencing  libraries 
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Fraction  covered  at  lOx 
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0.87 
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High  confidence  mutations 

11 

23 
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IS 

45 

19 

57 
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14 

Mutation  rate  per  Mb 

0.30 

0.63 

0.63 

0.41 

1.23 

0.52 

1.56 

1.64 

0.93 

0.38 

High  confidence  mutations  include  nonsynonymous,  synonymous,  UTR  (untranslated  region),  and  splice  junction  mutations,  and  introductions  of 


stop  codons.  Mutation  rate  is  based  on  36.5  Mb  of  exome  sequence  targeted  for  capture. 
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The  results  of  this  aim  demonstrate  that  low  grade  prostate  cancer  diverges  early 
from  high  grade  prostate  cancer  (Figure  2),  and  that  in  some  cases  we  were  not  able  to 
rule  out  that  low  and  high  grade  cancer  derived  from  separate  clones. 

We  also  found  that  mutations  in  the  p53  signaling  pathway  are  exclusive  to  high 
grade  foci,  suggesting  that  these  mutations  may  be  a  marker  of  high  risk  disease.  Results 
from  this  aim  are  now  published  and  the  manuscript  is  included  as  an  appendix  to  this 
report  (VanderWeele  et  al,  Cancer  Science,  2014).  This  work  fulfills  the  following 
sub  tasks  from  the  SOW:  sub  task  2.1,  sub  task  2.3,  sub  task  3.1. 

Aim  lb:  Determine  if  coincident  acinar  and  ductal 
adenocarcinoma  are  clonally  related. 

Prior  to  submitting  the  award  proposal,  I  had 
obtained  approval  for  IRB  protocol  10-270- A  to 
obtain  deidentified  prostate  tissue  specimens  for  use 
in  this  project.  Shortly  after  the  PRTA  was  awarded, 
this  protocol  was  amended  to  allow  use  of  samples 
from  the  PCBN.  The  PCBN  was  able  to  provide  a 
single  specimen  of  coincident  acinar  and  ductal 
adenocarcinoma  with  sufficient  material  to  prepare 
sequencing  libraries. 

I  have  worked  with  the  Human  Tissue 
Resources  Core  at  the  University  of  Chicago  to 
identify  additional  cases  of  coincident  ductal  and 
acinar  carcinoma  with  sufficient  material  to  prepare 
sequencing  libraries.  Not  all  the  specimens  were  of 
sufficient  purity  for  macrodissection,  and  so  a 
number  have  undergone  laser  microdissection  (Figure  3).  I  aimed  to  evaluate  at  least  5 
specimens.  I  have  isolated  acinar  and  ductal  tumor  DNA  from  6  specimens  in  addition  to 
the  one  from  the  PCBN,  with  an  additional  4  cases  available  in  case  I  again  run  into 
issues  of  sample  quality  as  I  did  for  Aim  la. 

This  work  addresses  the  following  subtasks  from  the  SOW:  Subtask  1.1,  subtask 
1.3.  I  had  planned  on  completing  sequencing  library  preparation  for  these  samples 
(included  in  subtask  1.3).  That  has  not  been  completed  yet  but  will  be  started  once  I  have 
obtained  normal  DNA  from  adjacent  tissue.  I  expect  obtaining  normal  tissue  (to  be 
completed  soon)  to  be  far  simpler  than  the  task  of  performing  laser  microdissection  to 
obtain  tumor  tissue  (already  completed).  A  minor  setback  was  that  I  was  expecting  to  be 
able  to  use  more  samples  from  the  PCBN,  but  not  as  many  were  available  as  I  had 
thought.  I  have  made  up  for  that  with  samples  from  the  University  of  Chicago. 


Figure  1.  Confirmation  of  high  confidence 
somatic  mutations  with  capillary 
sequencing  by  number  of  total  and  variant 
reads  (gray  =  not  confirmed;  black  = 
confirmed).  Confirmation  rate  =  0.85 
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At  the  time  the  proposal  for  this  project 
was  submitted,  next-generation  sequencing 
costs  were  decreasing  rapidly,  and  thus  I 
proposed  doing  whole  genome  sequencing  of 
these  samples.  However,  sequencing  costs 
have  since  plateaued  and  whole  genome 
sequencing  is  not  as  affordable  as  was 
anticipated.  Therefore,  instead  of  deep 
coverage  whole  genome  sequencing,  I  plan  to 
perform  deep  coverage  exome  sequencing  to 
identify  single  nucleotide  variants,  and  low 
coverage  whole  genome  sequencing  to  provide 
information  about  copy  number  changes  and 
identify  potential  structural  variants.  This 
strategy  has  been  employed  by  many  others, 
including  The  Cancer  Genome  Atlas  project, 
with  great  success  (REF###).  The  copy 
number  changes  and  structural  variants  will 
allow  evaluation  of  clonal  relationships  at  a 
genome-wide  level,  which  is  what  deep 
coverage  whole  genome  sequencing  was 
intended  to  do. 


Aim  2:  Characterize  the  molecular 
relationship  between  metastatic  disease  and 
primary  prostate  foci. 

Aim  2a:  Test  the  hypothesis  that  nodal 
metastases  are  subclones  of  high  grade 
prostate  cancer  foci  in  the  prostate. 

As  in  Aim  la,  I  have  completed 
sequencing  and  data  analysis  of  tumor  DNA 
from  lymph  node  metastasis  from  two  patients 
with  coincident  low  and  high  grade  foci  in  the 
prostate  (Table  1).  A  subset  of  the  high 
confidence  somatic  mutations  identified  by  VarScan  have  been  validated  by  Sanger 
sequencing  and  custom  capture 
and  resequencing  (Figure  1). 

The  results  of  this  sub- 
Aim  confirm  that  nodal 
metastases  are  closely  related  to 
high  grade  prostate  cancer,  and 
distantly  related  to  low  grade 
prostate  cancer  (Figure  2). 

Notably,  the  data  demonstrate  that 


Figure  2.  Venn  diagrams  (left)  depicting  the 
pattern  of  shared  and  private  high  confidence 
somatic  mutations  and  phylogenetic  trees  (right) 
depicting  the  relationship  of  coincident  foci  for 
PrCa  18  (A),  PrCa  14  (B),  PrCa  6  (C),  and  PrCa 
25  (D).  The  number  of  high  confidence  somatic 
mutations  is  labeled  within  each  Venn  diagram. 
In  C,  there  are  no  mutations  that  are  shared 
between  the  low  grade  focus  and  the  metastatic 
focus  and  not  also  shared  with  the  high  grade 
focus.  Gray  =  uninvolved  prostate;  blue  =  low 
grade  focus;  green  =  high  grade  focus;  purple  = 
metastatic  focus;  black  =  theoretical  common 
progenitor. 
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few  mutations  are  required  for  metastases  beyond  what  is  found  in  high  grade  cancer. 

The  data  are  also  consistent  with  the  hypothesis  that  nodal  metastases  are  subclones  of 
high  grade  prostate  foci,  for  mutations  identified  in  lymph  node  metastases  but  not 
initially  identified  in  high  grade  foci  can  be  identified  at  low  levels  in  high  grade  foci 
(Figure  4). 

Results  from  this  aim  are  now  published  and  the  manuscript  is  included  as  an 
appendix  to  this  report  (VanderWeele  et  al,  Cancer  Science,  2014).  This  work  fulfills  the 
following  sub  tasks  from  the  SOW:  sub  task  2.1,  sub  task  2.3,  sub  task  3.1. 


Figure  4.  Mutations  only  identified  in  metastasis  are  also  found  at  low  frequency  in  high  grade  foci. 
Mutations  identified  in  the  metastasis  are  shown  on  the  x-axis,  and  variant  allele  frequency  on  the  y- 
axis.  Mutations  marked  with  a  red  star  were  not  initially  identified  in  the  high  grade  focus. 
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Aim  2b:  Determine  the  molecular  relationship  of  CTCs  to  primary  prostate  cancer  foci. 

To  address  this  aim,  I  have  obtained  DNA  isolated  from  CTCs  from  7  patients, 
with  biological  replicates  from  three  of  those  patients.  However,  library  preparation  and 
sequencing  have  not  yet  begun.  There  are  two  tasks  related  to  this  aim:  evaluating  the 
feasibility  of  sequencing  CTCs,  which  involves  sequencing  biological  replicates  from 
two  subjects;  and  determining  the  relationship  of  CTCs  to  primary  prostate  cancer  foci. 
Given  the  financial  cost  of  conducting  these  experiments,  along  with  my  mentors  I 
decided  it  is  better  to  consolidate  these  efforts.  Furthermore,  in  addition  to  the 
relationship  between  CTCs  and  primary  cancer  foci,  I  am  also  interested  in  the 
relationship  between  CTCs  and  synchronous  metastases  to  determine  if  CTCs  represent 
current  metastatic  disease.  Therefore  I  have  switched  to  recruiting  exclusively  patients 
for  whom  primary  tissue  and  metastatic  tissue  are  available.  We  have  found  numerous 
patients  from  whom  we’ve  collected  CTCs,  who  do  not  have  primary  or  metastatic  tissue 
available,  and  a  publication  evaluating  protein  expression  in  those  patients’  CTCs  is 
accepted  for  publication.  However,  it  has  been  more  challenging  to  find  patients  with 
both  these  tissue  specimens  available. 

Nevertheless,  two  patients  have  been  recruited  with  primary  and  late  metastatic 
disease,  both  of  whom  underwent  CTC  collection  twice.  One  has  biological  replicate 
CTC  DNA  that  has  been  whole  genome  amplified.  CTC  DNA  from  the  2nd  patient  failed 
to  amplify  on  one  of  the  replicates,  and  he  is  scheduled  to  undergo  CTC  collection  again 
within  a  few  weeks.  To  avoid  batch  effects  I  have  delayed  sequencing  library 


8 


preparation  until  all  the  samples  from  both  patients  can  be  prepared  at  the  same  time.  I 
anticipate  that  will  begin  within  1-2  months.  In  collaboration  with  Dr.  Szmulewitz  I  have 
identified  7  additional  patients  with  metastatic  disease  and  primary  tissue  available  who 
will  be  recruited  for  CTC  collection  when  they  are  progressing  on  their  current  therapies, 
3  of  whom  are  scheduled  to  be  collected  within  the  next  month. 

This  work  addresses  subtask  1.2  and  subtask  1.4.  Work  on  subtask  2.2  is 
expected  to  occur  within  3  months.  Though  there  has  been  a  delay  in  performing  the 
sequencing,  I  believe  that  the  sequencing  data  we  obtain  will  be  more  valuable  because  I 
am  selecting  patients  with  more  informative  specimens  available. 

As  in  Aim  lb,  at  the  time  the  proposal  for  this  project  was  submitted,  next- 
generation  sequencing  costs  were  decreasing  rapidly,  and  thus  I  proposed  doing  whole 
genome  sequencing  of  these  samples.  Due  to  persistent  high  costs,  however,  instead  of 
deep  coverage  whole  genome  sequencing,  I  plan  to  perform  deep  coverage  exome 
sequencing  to  identify  single  nucleotide  variants,  and  low  coverage  whole  genome 
sequencing  to  provide  information  about  copy  number  changes  and  identify  potential 
structural  variants. 

Mentorship 

Dr.  Walter  Stadler  continues  to  be  my  mentor  for  clinical  and  translation  research 
and  has  continued  to  promote  my  professional  development.  I  meet  with  him  formally 
monthly  and  informally  several  times  throughout  the  week. 

Dr.  Yusuke  Nakamura,  Professor  of  Medicine,  is  my  new  primary  basic  science 
mentor.  He  is  one  of  the  pioneers  of  applying  the  study  of  genetic  variation  to  the 
medical  field.  He  joined  the  faculty  at  University  of  Chicago  in  2012  in  the  department 
of  Medicine  and  continues  his  research  in  cancer  genomics,  especially  that  of  prostate 
cancer  and  other  GU  cancers.  He  is  a  recent  recruit  to  the  University  of  Chicago  and  his 
interests  align  more  closely  with  my  interests  did  those  of  Dr.  Kevin  White.  In  addition, 
as  a  physician  and  member  of  the  section  of  hematology/oncology  he  is  more  focused  on 
translational  aspects  of  his  work.  I  meet  with  Dr.  Nakamura  weekly  on  an  informal  basis 
and  as  part  of  the  Nakamura  group  lab  meetings.  I  meet  with  him  more  formally 
monthly. 

Dr.  Funmi  Olopade  continues  as  a  mentor  as  an  expert  in  cancer  genetics  and 
genetics  of  hormone-regulated  cancers.  She  is  also  an  important  advisor  regarding  career 
development. 

Key  Research  Accomplishments 

•  Publication  of  the  first  manuscript  from  this  project,  “Low  grade  prostate  cancer 

diverges  early  from  high  grade  and  metastatic  disease.” 

Conclusion 

Like  other  cancers,  prostate  cancer  is  a  genetic  disease  that  progresses  through 
evolution  and  accumulation  of  genetic  changes.  Though  low  grade  and  high  grade 
prostate  cancer  can  evolve  from  a  common  progenitor,  they  are  distantly  related,  and  in 
some  cases  arise  independently.  In  contrast,  metastatic  disease  is  closely  related  to  high 


9 


grade  localized  disease.  Certain  genomic  alterations  likely  mark  high  grade,  high  risk 
disease,  and  these  includes  changes  in  the  TP53  pathway. 
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expression  in  circulating  tumor  cells  from  patients  with  metastatic  castrate 
resistant  prostate  cancer.  Journal  of  Translational  Medicine.  In  press. 

Inventions,  Patents  and  Licenses 

Nothing  to  report 

Reportable  Outcomes 

Nothing  to  report 

Other  Achievements 

The  data  from  this  work,  and  the  protected  time  afforded  by  this  award,  led  to  my 
successful  application  for  two  additional  grants  and  another  pending  application  which 
passed  initial  evaluation  and  is  now  under  review.  These  include  a  pilot  award  from  our 
institution’s  cancer  center,  a  Young  Investigator  Award  from  the  Cancer  Research 
Foundation,  and  an  award  from  Psi  Beta  Phi.  The  work  supported  by  this  award  has  led 
to  me  being  recruited  as  a  faculty  member  at  University  of  Chicago,  and  I  am  also  a 
member  of  the  University  of  Chicago  Comprehensive  Cancer  Center. 

I  have  successfully  transitioned  to  having  my  own  lab,  under  the  mentorship  of 
the  three  excellent  mentors  involved  with  this  award.  I  have  a  research  professional  and  a 
technician  in  the  lab,  last  year  I  mentored  a  urology  resident  working  in  the  lab,  and  this 
past  summer  I  mentored  an  undergraduate  student  who  was  part  of  the  Chicago 
Academic  Medicine  Program  at  the  Pritzker  School  of  Medicine. 

I  continue  to  be  interested  in  prostate  cancer  tumor  heterogeneity,  and  in  order  to 
obtain  higher  quality  clinical  specimens  I  have  established  a  tissue  bank  of  prostatectomy 
specimens  and  biopsy  specimens  fixed  in  the  non-crosslinking  agent  PAXgene.  I’ve  also 
collected  matched  blood,  plasma,  and  urine  samples. 

Opportunities  for  Training 
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As  per  the  SOW,  I  have  attended  the  following  courses: 

From  year  1 : 

Responsible  Conduct  of  Research 

Ecology  and  Evolution:  Principles  of  Population  Genetics  I 
Ecology  and  Evolution:  Genomic  Evolution 

From  year  2: 

Human  Genetics:  Introduction  to  Probability  and  Statistics  for  Geneticists 
From  year  3: 

I  am  currently  taking  Statistics:  Statistical  Methods  and  their  Applications 

The  Ecology  and  Evolution:  Fundamentals  of  Molecular  Evolution  course  I  had  planned 
for  year  1  is  no  longer  offered  and  there  is  no  equivalent  course  in  its  place. 

Opportunities  for  Professional  Development 

I  have  had  the  opportunity  to  attend  and  participate  in  many  conferences  during 
the  first  year  of  this  award.  I  presented  a  poster  on  my  work  looking  at  low  and  high 
grade  cancer  foci  at  the  Prostate  Cancer  Foundation  Scientific  Retreat.  This  meeting 
brings  together  the  brightest  minds  in  prostate  cancer  research  along  with  representatives 
from  industry  and  policy. 

I  presented  a  poster  on  our  initial  work  with  CTCs  at  the  GU  cancer  symposium, 
which  is  a  more  clinically  oriented  meeting  affiliated  with  the  American  Society  of 
Clinical  Oncology.  I  also  attended  the  ASCO  annual  meeting. 

I  am  the  institutional  PI  for  several  clinical  trials,  including  a  trial  evaluating 
neoadjuvant  chemotherapy  for  high  risk  prostate  cancer,  which  is  run  through  the 
CALGB/ Alliance  clinical  trials  consortium.  As  my  role  I  also  participate  in  the  semi¬ 
annual  CALGB/Alliance  meetings. 
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Understanding  the  developmental  relationship  between  indolent  and  aggressive 
tumors  is  central  to  understanding  disease  progression  and  making  treatment 
decisions.  For  example,  most  men  diagnosed  with  prostate  cancer  have  clinically 
indolent  disease  and  die  from  other  causes.  Overtreatment  of  prostate  cancer 
remains  a  concern.  Here  we  use  laser  microdissection  followed  by  exome 
sequencing  of  low-  and  high-grade  prostate  cancer  foci  from  four  subjects,  and 
metastatic  disease  from  two  of  those  subjects,  to  evaluate  the  molecular  relation¬ 
ship  of  coincident  cancer  foci.  Seventy  of  79  (87%)  high-confidence  somatic  muta¬ 
tions  in  low-grade  disease  were  private  to  low-grade  foci.  In  contrast,  high-grade 
foci  and  metastases  harbored  many  of  the  same  mutations.  In  cases  in  which 
there  was  a  metastatic  focus,  15  of  80  (19%)  high-confidence  somatic  mutations 
in  high-grade  foci  were  private.  Seven  of  the  80  (9%)  were  shared  with  low- 
grade  foci  and  65  (82%)  were  shared  with  metastatic  foci.  Notably,  mutations  in 
cancer-associated  genes  and  the  p53  signaling  pathway  were  found  exclusively 
in  high-grade  foci  and  metastases.  The  pattern  of  mutations  is  consistent  with 
early  divergence  between  low-  and  high-grade  foci  and  late  divergence  between 
high-grade  foci  and  metastases.  These  data  provide  insights  into  the  develop¬ 
ment  of  high-grade  and  metastatic  prostate  cancer. 
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Cancers  are  thought  to  progress  from  low  grade  to  higher 
grade/ 1}  leading  to  efforts  to  detect  and  eradicate  low- 
grade  disease  in  order  to  prevent  morbidity  and  mortality  from 
high-grade  disease.  The  grade  of  prostate  cancer  is  character¬ 
ized  by  the  Gleason  score  on  a  scale  of  2-10,  with  10  repre¬ 
senting  the  most  poorly  differentiated  tumors.  Although 
initially  described  as  intermediate  grade,  Gleason  6  cancers  are 
clinically  indolent  and  have  an  overwhelmingly  favorable 
prognosis. (2,3)  For  the  purposes  of  this  report,  Gleason  6  foci 
are  regarded  as  low  grade. 

Approximately  238  590  men  were  diagnosed  with  prostate 
cancer  in  the  US  in  2013,  the  majority  of  them  with  low-grade 
disease.  It  is  estimated  that  29  720  men  will  die  from  prostate 
cancer,  which  is  among  the  lowest  case/fatality  ratio  of  any 
cancer. (4)  Although  prostate  specific  antigen  (PSA)  screening  is 
able  to  identify  early  stage  prostate  cancer,  most  men  diag¬ 
nosed  with  prostate  cancer  die  from  competing  causes.  It  is 
therefore  unclear  whether  early  detection  associated  with  PSA 
screening  decreases  mortality. (5,6)  Current  clinical  treatment 
guidelines  support  active  surveillance  for  patients  with 
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low-grade  prostate  cancer  for  which  the  risk  of  tumor  progres¬ 
sion  is  low.(7)  Recently  some  guidelines  have  even  recom¬ 
mended  against  PSA  screening. (8)  Failure  to  follow  these 
guidelines  is  likely  due,  in  part,  to  concern  that  low-grade  foci 
progress  to  higher  grade  and  metastatic  disease. 

Several  studies  have  contributed  to  a  growing  catalog  of 
genetic  variation  in  prostate  cancer, (9_14)  giving  insight  into 
single  nucleotide  variations,  rearrangements  and  copy  number 
variation  prevalent  in  this  disease.  Aside  from  common  rear¬ 
rangements  involving  the  ETS  family  of  transcription  factors, 
especially  v-ets  avian  erythroblastosis  virus  E26  oncogene 
homolog  (ERG),  the  mutational  spectrum  of  prostate  cancers  is 
diverse. (10)  Although  increasing  attention  is  being  paid  to  the 
overall  molecular  heterogeneity  of  cancers  within  a  single 
patient,  there  is  limited  data  in  this  regard  specifically  for  pros¬ 
tate  cancer.  This  is  especially  relevant  for  prostate  cancer, 
which  is  often  multifocal. 

More  specifically,  it  is  not  clear  if  prostate  cancer  develops 
as  a  multifocal  disease  in  which  the  foci  are  clonally  related 
and  share  the  vast  majority  of  somatic  mutations,  or  if  it  is  a 
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multicentric  disease,  with  each  focus  having  its  own  set  of  pri¬ 
vate  somatic  mutations  and  each  representing  an  independent 
primary  lesion.  Recent  studies  have  examined  multifocal 
disease  and  found  evidence  of  a  monoclonal  origin  of  coinci¬ 
dent  foci,(15)  even  when  the  foci  have  different  Gleason  pat¬ 
terns/ 16,1 7)  This  conclusion  is  somewhat  surprising,  given  the 
difference  in  clinical  behavior  of  different  Gleason  patterns. 

The  present  study  targets  multifocal  disease,  specifically 
with  low-grade  (Gleason  6)  and  high-grade  (Gleason  8  or 
higher)  foci,  and  compares  the  molecular  relationship  of  these 
foci  with  each  other  and  with  synchronous  metastatic  disease. 
To  expand  the  number  of  specimens  available  for  the  present 
study,  prostatectomy  samples  that  had  been  formalin  fixed  and 
paraffin  embedded  (FFPE)  were  used,  with  modifications  made 
to  standard  protocols  to  increase  the  yield  of  material  used  for 
sequencing.  Exome  sequencing  was  used  to  achieve  a  compre¬ 
hensive  evaluation  of  all  genes  harboring  somatic  point  muta¬ 
tions.  The  findings  indicate  that  there  is  early  divergence  or 
complete  independence  of  low-grade  and  high-grade  disease, 
but  late  divergence  of  high-grade  disease  and  synchronous 
metastases. 

Materials  and  Methods 

Tissue  and  laser  capture  microdissection  (LMD).  Paraffin- 
embedded  prostate  and  lymph  node  tissue  that  was  1-5  years 
old  and  matched  FFPE  and  frozen  normal  prostate  tissue  was 
obtained  from  the  Human  Tissue  Resource  Center  (HTRC)  at 
the  University  of  Chicago  with  the  approval  of  the  Internal 
Review  Board.  Tissue  was  reviewed  by  a  genitourinary  pathol¬ 
ogist  (J.B.T.)  and  regions  of  interest  were  marked  for  further 
evaluation.  Four  specimens  had  sufficient  low-  and  high-grade 
disease  for  LMD.  Two  specimens  (PrCa  6  and  PrCa  18)  also 
had  sufficient  lymph  node  metastatic  disease.  Tumor  foci  and 
histologically  normal  prostate  were  microdissected  from  10- 
pm  sections  using  a  Leica  Laser  Microdissection  (Wetzlar, 
Germany)  system. 

Exome  library  preparation.  Genomic  FFPE  DNA  was  isolated 
using  a  QIAamp  DNA  FFPE  Tissue  Kit  (Qiagen,  Valencia, 
CA,  USA)  following  the  manufacturer’s  instructions,  but 
increasing  the  incubation  in  Proteinase  K  to  overnight  and 
reducing  the  90°C  incubation  to  30  min.  DNA  from  frozen  tis¬ 
sue  was  isolated  using  a  QIAamp  Mini  Kit  (Qiagen).  DNA 


Table  1.  Subject  characteristics 


Case 

Patient  age  (years) 

Race 

Stage 

Gleason  -  glandf 

Library 

Gleason  -  focusj 

PrCa  18 

65 

Caucasian 

pT3bN1  Mx 

4  +  3 

Normal 

NA 

Low  grade 

6 

High  grade 

8 

Metastasis 

NA 

PrCa  14 

54 

African- 

pT3bN1  Mx 

4  +  3 

Normal 

NA 

American 

Low  grade 

6 

High  grade 

8 

PrCa  6 

70 

Caucasian 

pT3bN1  Mx 

4  +  3  (5) 

Normal 

NA 

Low  grade 

6 

High  grade 

8 

Metastasis 

NA 

PrCa  25 

59 

Asian 

pT2cN0Mx 

3  +  4 

Normal 

NA 

Low  grade  6 

High  grade  9 


tGleason  score  assigned  on  pathological  review  of  the  prostatectomy  specimen.  JGleason  score  of  the  focus  isolated  by  microdissection.  PrCa  6 
had  a  Gleason  score  of  4  +  3  with  a  minor  component  of  5.  PrCa  14  had  metastatic  disease  in  a  lymph  node,  which  was  not  of  sufficient  mass  to 
be  isolated  for  sequencing.  NA,  not  applicable. 


was  fragmented  using  a  Covaris  S2  sonicator  (Woburn,  MA, 
USA),  with  settings  of  10%  duty  cycle,  intensity  4,  200  cycles 
per  burst  and  120  s.  The  fragmented  DNA  was  blunt-end 
repaired  using  T4  Polymerase,  Klenow  and  T4  PNK,  and  ade- 
nylated.  Fragments  were  ligated  to  adaptors  ISl_adapter_P5.F 
and  IS3_adapter_P5  +  P7.R(18)  and  an  index  tag  added.  The 
product  was  amplified  and  cleaned  up  using  Agencourt  AM- 
Pure  XP  beads  (Beckman  Coulter,  Indianapolis,  IN,  USA). 

Libraries  were  run  on  a  BioAnalyzer  (Agilent,  Santa  Clara, 
CA,  USA)  to  confirm  fragment  size.  The  libraries  from  the 
metastases  did  not  undergo  further  size  selection.  The  remain¬ 
ing  libraries  underwent  selection  using  an  E-gel  (Invitrogen, 
Grand  Island,  NY,  USA)  followed  by  PCR  amplification. 

Enrichment  for  exomes  was  performed  using  NimbleGen 
SeqCap  EZExome  v2.0  (Roche,  Madison,  WI,  USA)  following 
the  manufacturer’s  instructions.  The  exome-enriched  library 
DNA  was  amplified  and  run  on  a  BioAnalyzer  to  confirm  frag¬ 
ment  size.  The  resulting  libraries  were  run  on  a  HiSeq  2000 
(Illumina,  San  Diego,  CA,  USA)  in  a  highly  multiplexed 
1  x  50  run  and  a  subsequent  2  x  100  run. 

Sequencing  analysis  and  identification  of  somatic  mutations. 
Overlapping  or  adapter-containing  reads  were  clipped  using 
SeqPrep-b83fd00.  Reads  were  quality  trimmed  and  aligned  to 
hgl8  using  bwa  0.5.9(19)  and  merged  with  duplicates  removed 
using  Samtools  0.1.18.(2O)  Genome  Analysis  Toolkit(21)  was 
used  for  local  realignment  and  quality  score  recalibration. 
Somatic  mutations  were  identified  using  VarScan  v2.2.8.(22) 
High-confidence  somatic  mutations  were  defined  as  variants 
with  coverage  of  at  least  10  x,  no  supporting  reads  in  normal, 
at  least  four  supporting  reads  in  the  tumor  and  with  supporting 
reads  on  both  strands.  Germline  single-nucleotide  polymor¬ 
phisms  (SNPs)  identified  in  dbSNP135,  1000  genomes  and  ex¬ 
omes  of  normal  controls  published  by  the  Max  Planck  Institute 
(MPI)(23-25)  were  removed.  SAMtools  was  used  to  determine 
the  number  of  total  and  variant  reads  in  matched  libraries  for 
positions  mutated  in  low-grade  foci.  Sample  ethnicity  was 
determined  by  principle  component  analysis  using  common, 
germline,  coding  variants  shared  with  samples  from  the  1000 
genomes  project. (24) 

Custom  capture  and  identification  of  previously  identified 
mutations.  DNA  from  seven  additional  foci  from  PrCa  6 
were  isolated  using  LMD  (Leica  Microsystems  LMD6500). 
Custom  capture  probes  targeting  high-confidence  mutations 
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identified  through  exome  sequencing  were  obtained  from 
Roche.  Sequencing  libraries  from  the  seven  loci  were  pre¬ 
pared  using  the  High-Throughput  Genome  Analysis  Core  at 
the  University  of  Chicago,  pooled,  captured  following  the 
manufacturer’s  instructions,  and  sequenced  on  one  lane  of  a 
HiSeq2000  (Illumina).  Reads  were  analyzed  with  SeqPrep- 
b83fdOO,  aligned  to  hgl8  using  bwa,  and  merged  with 
duplicates  removed  using  Samtools  0.1.18.  Samtools  was 
used  to  determine  the  number  of  total  and  variant  reads  in 
each  library  for  each  variant  position.  Variants  were  consid¬ 
ered  present  if  >10%  of  reads  supported  the  variant. 

Confirmation  of  somatic  mutations  and  immunohistochemisty 
(IHC).  Mutations  were  visualized  using  IGV  2.0.(26)  Genomic 
DNA  from  each  focus  underwent  amplification  using  Genome  - 
Plex  WGA2  (Sigma,  St.  Louis,  MO,  USA)  and  primers  target¬ 
ing  a  subset  of  variants  were  designed  using  Primer3Plus  and 
obtained  from  IDT  (Coral  ville,  I  A,  USA).  The  PCR  products 
were  sequenced  in  the  University  of  Chicago  DNA  Sequencing 
and  Genotyping  Facility.  ERG  IHC  was  performed  at  the 


HTRC  using  anti-ERG  antibody  EPR3864  (Abeam, 
Cambridge,  MA,  USA). 

Results 

To  evaluate  the  molecular  relationship  between  low-  and  high- 
grade  prostate  cancer,  we  identified  multifocal  formalin-fixed, 
paraffin-embedded  specimens  with  both  low-grade  and  high- 
grade  disease  with  sufficient  mass  for  laser  capture  microdis¬ 
section.  The  subjects  from  whom  the  specimens  came  were 
representative  of  patients  with  high-risk  localized  prostate  can¬ 
cer  (Table  1). 

Exome  sequencing  was  performed  on  matched  cancer  foci 
and  histologically  normal  prostate  glands  from  four  specimens 
(Fig.  1).  To  improve  the  yield  of  DNA  for  downstream 
sequencing,  proteinase  K  digestion  was  increased  to  18  h,  90° 
incubation  was  reduced  to  30  min  and  size  selection  by  gel 
extraction  was  replaced  with  size  selection  with  an  E-gel  or 
eliminated  altogether.  On  average,  88%  of  RefSeq  coding 
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Fig.  1.  Histology  of  coincident  prostate  cancer  foci.  Representative  H&E  stains  and  illustrations  representing  the  prostate  cancer  foci  that  were 
laser  microdissected.  Two  cancer  foci  and  uninvolved  prostate  glands  were  isolated  from  PrCa  18  (a),  PrCa  14  (b),  PrCa  6  (c)  and  PrCa  25  (d).  In 
addition,  a  metastatic  (Met)  focus  was  isolated  from  PrCa  18  and  PrCa  6.  For  PrCa  14  (b),  the  foci  were  from  different  levels  of  the  prostate.  For 
the  other  three  specimens  the  foci  were  at  the  same  level.  Light  gray,  histologically  normal  prostate;  dark  gray,  low-grade  cancer  focus;  striped, 
high-grade  cancer  focus;  checked,  metastatic  focus  from  a  lymph  node  removed  at  the  time  of  prostatectomy. 
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Fig.  2.  Mutation  characteristics,  (a)  Number  of 
high-confidence  somatic  mutations  across  all  foci. 
Non-silent,  non-silent  mutations;  Unique,  number 
of  unique  genes  harboring  a  non-silent  mutation; 
Reported,  gene  reported  to  be  mutated  in 
references  9-12  and  14.  (b)  Spectrum  of  unique 
high  confidence  somatic  mutations  across  all  foci, 
(c)  Number  of  high-confidence  somatic  mutation 
types  in  all  prostate  samples.  L,  low-grade  focus;  H, 
high-grade  focus;  M,  metastatic  focus;  NS, 
nonsynonymous;  SYN,  synonymous;  UTR3,  3' 
untranslated  region;  UTR5,  5'  untranslated  region; 
SJ,  splice  junction;  STOP,  premature  stop  codon,  (d) 
Confirmation  of  high-confidence  somatic  mutations 
with  capillary  sequencing  by  number  of  total  and 
variant  reads  (gray,  not  confirmed;  black, 
confirmed).  Confirmation  rate,  0.85. 


bases  were  covered  at  10  x  or  greater.  A  mean  of  30.1  somatic 
coding  mutations  was  identified  per  cancer  focus  (Supporting 
Information  Table  SI). 

To  evaluate  potential  bias  introduced  by  formalin  fixation, 
matched  FFPE  and  frozen  (Optimal  Cutting  Temperature 
compound  [OCT] -embedded)  samples  of  normal  prostate  were 
subjected  to  exome  sequencing.  The  number  of  false  positive 
variant  reads  was  similar  (Fig.  SI  A),  as  were  the  types  of 
mutations  (Fig.  SIB),  suggesting  that  there  was  little  bias 
introduced  from  formalin  fixation. 

Three  hundred  and  one  somatic  mutations  were  identified 
across  10  foci  (Table  S2).  Of  these,  71  were  synonymous  and 
unlikely  to  alter  gene  function  (Fig.  2a).  Of  174  unique  genes 
harboring  somatic  mutations  that  were  likely  to  be  functional,  73 
were  not  reported  in  five  previous  reports  of  mutations 
in  localized  and  advanced  disease/9-12,1  }  The  majority  of 
variants  identified  were  transitions  leading  to  nonsynonymous 
mutations  (Fig.  2b, c),  as  expected.  A  subset  of  20  high- 
confidence  somatic  mutations  was  confirmed  in  tumor  and  unin¬ 
volved  tissue  with  capillary  sequencing  (Fig.  2d).  Those  that 
were  not  confirmed  by  capillary  sequencing  had  low  variant 
allele  frequency  and  were  approximately  the  level  of  detection 
for  capillary  sequencing  (-20%).  Nevertheless,  the  mutation 
(0.82/Mb)  and  validation  rate  (85%)  using  fixed  prostate 
specimens  in  the  present  study  are  similar  to  previous  reports 
(0.9/Mb  and  91%,  respectively)  using  frozen  tissue.(9)  The  high 
fidelity  of  this  sequencing  supports  the  use  of  formalin-fixed  tis¬ 
sue  for  next-generation  sequencing  studies. 

The  vast  majority  of  somatic  mutations  in  low-  and  high-grade 
foci  were  private  to  the  focus  in  which  they  were  identified.  Nine 
of  79  (1 1%)  high-confidence  somatic  mutations  in  the  low-grade 
foci  were  shared  with  a  high-grade  focus,  and  nine  of  139  (7%) 
high-confidence  somatic  mutations  in  the  high-grade  foci  were 
shared  with  a  low-grade  focus  (Fig.  3).  Capillary  sequencing 
confirmed  that  these  nine  shared  somatic  mutations  were  not 
germline  variants.  In  two  subjects  (14  and  18)  all  of  the  muta¬ 
tions  were  private.  In  the  other  two  subjects  (6  and  25)  a  small 
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minority  of  mutations  was  shared.  In  contrast  to  the  low-grade 
foci,  67  of  80  (84%)  high-confidence  somatic  mutations  identi¬ 
fied  in  high-grade  foci  were  shared  with  metastatic  disease  and 
metastatic  foci  had  few  mutations  beyond  those  identified  in 
high-grade  disease  (Fig.  3a,c). 

To  evaluate  more  broadly  the  clonal  relationship  of  multifo¬ 
cal  disease,  custom  capture  baits  targeting  variants  identified 
through  exome  sequencing  of  PrCa  6  were  used  to  capture  the 
DNA  from  seven  additional  cancer  foci  from  that  subject 
(Table  S3).  The  average  coverage  depth  of  these  variants  was 
208  and  78  previously  identified  variants  were  covered  at 
>70  x  in  each  focus.  Three  high-grade  foci  harbored  a  median 
of  56  previously  identified  variants,  three  intermediate-grade 
foci  harbored  a  median  of  15  variants  and  one  low-grade  focus 
harbored  three  variants.  As  in  the  whole  exome  data  a  small 
minority  of  mutations  was  shared,  with  just  two  of  78  (2.6%) 
variants  found  to  be  ubiquitous  among  the  seven  cancer  foci. 
Fourteen  of  78  (17.9%)  were  not  identified  in  any  of  the  seven 
additional  foci  (Fig.  S2).  Eight  of  these  14  were  among  the  12 
variants  originally  identified  as  private  to  the  low-grade  focus. 

Rearrangements  involving  ERG  are  the  most  common  genetic 
derangement  in  prostate  cancer  and  are  thought  to  occur  early  in 
tumorigenesis.  These  events  are  not  typically  captured  by  exome 
sequencing.  Therefore  we  performed  IHC  for  ERG  to  look  for 
overexpression,  which  is  typically  restricted  to  cells  harboring 
an  ERG  rearrangement.  Of  the  two  specimens  exhibiting  posi¬ 
tive  ERG  staining,  PrCa  6  showed  consistently  positive  staining 
in  all  tumor  foci  (Fig.  4).  In  contrast,  in  PrCa  18  the  low-grade 
focus  was  negative,  the  high-grade  focus  was  equivocal  with 
few  cells  positive  for  ERG  and  the  metastatic  focus  was  more 
widely  focally  positive. 

Together  with  the  immunohistochemical  analysis,  the  pat¬ 
terns  of  shared  and  private  mutations  indicate  early  divergence 
of  low-  and  high-grade  foci.  In  two  cases  no  shared  mutations 
were  identified  and  independent  origin  of  low-  and  high-grade 
disease  cannot  be  ruled  out.  In  contrast,  there  is  late  diver¬ 
gence  of  high-grade  foci  and  metastatic  disease.  Consistent 
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Fig.  3.  The  molecular  relationship  of  coincident  foci.  Venn  diagrams 
(left)  depicting  the  pattern  of  shared  and  private  high-confidence 
somatic  mutations  and  phylogenetic  trees  (right)  depicting  the  rela¬ 
tionship  of  coincident  foci  for  PrCa  18  (a),  PrCa  14  (b),  PrCa  6  (c)  and 
PrCa  25  (d).  The  number  of  high-confidence  somatic  mutations  is 
labeled  within  each  Venn  diagram.  In  (c),  there  are  no  mutations  that 
are  shared  between  the  low-grade  focus  and  the  metastatic  focus  and 
also  not  shared  with  the  high-grade  focus.  Within  each  phylogenetic 
tree,  branch  length  is  proportional  to  the  number  of  mutations.  Gray, 
uninvolved  prostate;  blue,  low-grade  focus;  green,  high-grade  focus; 
purple,  metastatic  focus;  black,  theoretical  common  progenitor. 


with  this,  there  were  no  somatic  mutations  that  were  shared 
between  low  grade  and  metastatic  disease  and  also  not  shared 
with  high-grade  disease. 

Prostate  tumor  foci  consist  of  an  admixture  of  tumor  cells  and 
stromal  cells.  Despite  microdissection  of  the  tumor  foci,  the  var¬ 


iant  allele  frequencies  in  the  low-grade  foci  were  modestly  lower 
than  the  high-grade  lesions  (Fig.  S3),  suggesting  higher  contami¬ 
nation  with  non-tumor  cells.  The  contaminating  stromal  cells 
may  interfere  with  the  ability  to  call  somatic  mutations,  espe¬ 
cially  in  the  low-grade  foci,  leading  to  a  falsely  elevated  number 
of  apparently  private  mutations  and  obscuring  the  clonal  rela¬ 
tionship  among  concurrent  foci.  Therefore  for  each  position 
identified  as  a  private  high-confidence  somatic  mutation  in  the 
low-grade  foci,  the  number  of  sequencing  reads  supporting  that 
mutation  in  the  matched  high  grade  and  metastatic  foci  was 
determined,  relaxing  filters  based  on  DNA  strand  and  quality 
scores  (Fig.  S4).  Of  the  23  high-confidence  somatic  mutations  in 
PrCa  18  and  PrCa  6  identified  as  private  mutations,  none  had 
two  or  more  variant  reads  in  high  grade  or  metastatic  foci.  Thus, 
even  with  a  less  conservative  definition  of  mutation  (two  variant 
reads),  high-grade  foci  and  metastases  harbored  few  variants 
seen  in  low-grade  foci  and  the  majority  of  variants  in  the  low- 
grade  foci  was  private.  This  was  corroborated  with  evidence 
from  capillary  sequencing.  Three  mutations  private  to  low-grade 
foci  and  two  shared  with  other  foci  were  among  the  subset  of 
mutations  confirmed  by  capillary  sequencing. 

Two  genes  exhibited  recurrent  nonsynonymous  mutations  or 
mutations  in  the  untranslated  region  (Table  S4).  Low  density 
lipoprotein  (LDL)  receptor-related  protein  IB,  which  encodes 
an  LDL-family  receptor,  is  frequently  deleted  in  multiple 
malignancies. (27)  However,  recent  work  suggests  its  high  fre¬ 
quency  of  mutation  may  not  imply  a  significant  role  in  can¬ 
cer/28^  ZNF717  is  one  of  a  number  of  zinc  finger  proteins  and 
its  role  in  malignancy  is  unclear. 

Comparison  of  genes  with  nonsynonymous  mutations  in  our 
data  to  the  Catalogue  Of  Somatic  Mutations  In  Cancer  (COS¬ 
MIC)  cancer  gene  census  revealed  three  genes  in  common 
(Table  2).  These  genes  include  TP53,  the  most  frequently 
mutated  gene  in  advanced  cancers  including  prostate  cancer/1  } 
as  well  as  ATM  and  SS18L1.  All  of  these  three  genes  were 
mutated  in  high  grade  or  metastatic  disease,  but  not  in  low-grade 
disease.  Gene  ontology  analysis(29)  revealed  enrichment  for 
genes  in  the  p53  signaling  pathway  among  high-grade  foci 
(adjusted  P-value  0.037),  including  TP53  and  ATM.  Cell  cycle 
genes  were  also  enriched  (adjusted  P-value  0.054).  No  other 
pathways  were  statistically  significantly  enriched. 


Discussion 

Given  the  disparate  outcomes  between  men  with  low-grade  pros¬ 
tate  cancer  and  those  with  high-grade  disease,  a  fundamental 
question  in  prostate  cancer  management  is  the  possible  molecu¬ 
lar  relationship  among  grades  and  metastatic  disease.  Multiple 
lines  of  evidence  suggest  multifocal  disease  is  often  indepen¬ 
dent/30,3^  However,  recent  reports  have  demonstrated  shared 
molecular  derangements  of  coincident  low-  and  high-grade 
foci.(16,17)  The  present  study  identifies  mutations  shared  between 
low-  and  high-grade  cancer  foci,  supporting  the  ability  of  a  sin¬ 
gle  progenitor  to  give  rise  to  both  low-  and  high-grade  disease. 
However,  the  relationship  between  these  foci  was  distant  relative 
to  the  relationship  between  high-grade  disease  and  synchronous 
metastases.  In  fact,  in  two  of  the  four  cases  it  cannot  be  ruled  out 
that  the  foci  arose  from  independent  origins.  The  early  diver¬ 
gence  of  low-  and  high-grade  disease  may  help  explain  distinct 
differences  in  their  clinical  behavior.  Indeed,  we  found  no  evi¬ 
dence  of  direct  progression  from  low  grade  to  metastatic  disease. 
Instead,  there  was  an  overwhelming  fraction  of  shared  mutations 
between  high  grade  and  metastatic  disease,  which  is  consistent 
with  late  divergence  of  these  high-grade  foci  (Fig.  5). 
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Fig.  4.  ERG  expression.  Immunohistochemical 
analysis  for  ERG  in  coincident  foci  from  PrCa  6  and 
PrCa  18. 


Table  2.  Overlap  of  mutated  genes  with  the  COSMIC  cancer  gene  census 


Subject 

Histology 

Chromosome 

Hg18  position 

Reference 

Variant 

Symbol 

GERP 

Type 

aa  Change 

6 

High 

11 

107675775 

G 

A 

ATM 

5.22 

STOP 

W1710* 

6 

LN 

11 

107675775 

G 

A 

ATM 

5.22 

STOP 

W1710* 

6 

High 

20 

60169915 

C 

G 

SS18L1 

3.98 

NS 

P87R 

6 

LN 

20 

60169915 

C 

G 

SS18L1 

3.98 

NS 

P87R 

14 

High 

17 

7517833 

C 

A 

TP53 

4.78 

NS 

C145F 

Three  genes  harboring  a  nonsynonymous  mutation  or  a  premature  stop  codon  are  also  found  in  the  COSMIC  cancer  gene  census.  All  three  were 
found  in  either  high-grade  foci  or  metastatic  disease,  but  not  in  low-grade  foci.  GERP,  genomic  evolutionary  rate  profiling  score;  LN,  lymph  node 
metastasis;  NS,  nonsynonymous;  STOP,  introduction  of  premature  stop  codon;  *,  stop  codon. 


For  the  cases  in  which  there  was  evidence  of  a  common 
progenitor  for  both  low-  and  high-grade  disease  (PrCa  25  and 
PrCa  6),  the  nature  of  the  common  progenitor  is  not  clear. 
Histological  low-grade  cancer,  HGPIN  or  even  histologically 
uninvolved  prostate  are  all  possible  progenitors.  If  the  progenitor 
is  not  histological  low-grade  cancer,  high-grade  and  low-grade 
disease  may  have  emerged  simultaneously.  Characterization  of 
the  common  progenitor  requires  further  study. 

The  present  study  demonstrates  there  is  late  divergence  of 
high-grade  cancer  and  synchronous  metastatic  disease,  confirm¬ 
ing  high-grade  disease  as  the  precursor  to  metastasis  and  sug¬ 
gesting  few  if  any  additional  mutations  are  required  for 
malignant  cells  to  gain  the  ability  to  metastasize.  Given  the  dif¬ 
ficulty  in  obtaining  tissue  from  typical  bone  metastases  in  pros¬ 
tate  cancer,  more  easily  accessible  prostate  lesions  might  offer 
a  substitute  for  identifying  genetic  derangements  leading  to  pri¬ 
mary  resistance  of  metastatic  disease.  However,  mechanisms  of 
secondary  resistance  are  likely  induced  or  enriched  by  the 
selective  pressure  of  therapy  and  thus  are  likely  found  at  low 
levels,  if  at  all,  in  the  primary  lesion. 

The  most  common  gene  fusion  event  in  prostate  cancer  is  the 
TMPRSS2-ERG  fusion,  leading  to  high  levels  of  expression  of 
ERG,  which  is  otherwise  expressed  at  very  low  levels  in  prostate 
cancer.  We  evaluated  for  the  presence  of  this  fusion  through 
IHC  for  ERG  and  those  findings  supported  the  evidence  from 
the  exome  sequencing  data.  A  limitation  of  exome  sequencing  is 
the  inability  to  identify  additional  chromosomal  rearrangements. 
Although  this  is  a  limitation  it  was  not  required  for  the  goal  of 
the  present  study,  which  was  to  identify  tumor  lineages. 

One  goal  of  sequencing  prostate  tumors  is  to  identify  bio¬ 
markers  that  will  supplement  the  prognosis  given  by  clinical 
features  alone.  To  that  end  we  sought  to  identify  mutated 
genes  or  pathways  that  were  enriched  in  high  grade  and  meta¬ 
static  disease.  TP53  is  the  most  frequently  mutated  gene  in 
advanced  prostate  cancer  and  we  found  the  p53  signaling 
pathway  to  harbor  mutations  only  in  high  grade  and  metastatic 
disease.  Thus,  members  of  the  p53  signaling  pathway  are 
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intriguing  candidates  for  potential  biomarkers  of  aggressive 
disease.  Recently,  expression  of  a  panel  of  cell  cycle  progres¬ 
sion-associated  genes  has  been  correlated  with  outcomes  of 
those  with  low-  or  intermediate-grade  prostate  cancer.(32)  We 
also  found  enrichment  for  cell  cycle-related  genes  among  those 
mutated  in  high-grade  disease,  although  this  was  of  borderline 
statistical  significance  in  this  small  cohort. 

Given  the  good  outcomes  of  low-grade  prostate  cancer,  we 
hypothesized  that  mutations  not  found  in  low-grade  disease  are 
more  likely  to  be  drivers  of  progression  and  metastasis  and 
therefore  more  important  potential  therapeutic  targets.  Indeed, 
the  three  genes  identified  here  that  are  also  found  in  the  COS¬ 
MIC  cancer  gene  census  were  identified  exclusively  in  high 
grade  and  metastatic  foci.  Of  note,  all  three  mutations  are  in 
highly  conserved  regions  as  evidenced  by  the  high  genomic 
evolutionary  rate  profiling  score  and  thus  are  likely  to  be 
biologically  significant.  Identification  of  driver  genes  through 
this  strategy  has  the  potential  to  guide  the  development  of 
future  therapies. 

The  present  study  contributes  to  the  growing  body  of  work 
demonstrating  that  next-generation  sequencing  from  microdis- 
sected  FFPE  prostatectomy  specimens  is  feasible.  Based  on 
these  techniques,  we  found  that  coincident  low-  and  high-grade 
prostate  cancer  can  emerge  through  either  independent 
progression  or  early  divergence  from  a  common  progenitor. 


Fig.  5.  Model  of  prostate  cancer  evolution.  Low-grade  and  high- 
grade  cancer  foci  progress  largely  in  parallel,  diverging  early  from  a 
common  progenitor.  Metastatic  disease  exhibits  late  divergence  from 
high-grade  disease.  It  cannot  be  ruled  out  that  in  some  cases  low-  and 
high-grade  diseases  arise  through  independent  origins. 
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High-grade  disease  is  characterized  by  mutations  in  known 
cancer-associated  genes  and  the  p53  signaling  pathway.  Meta¬ 
static  disease  is  closely  related  to  high-grade  primary  foci. 
These  data  can  be  leveraged  to  help  identify  potential  biomar¬ 
kers  and  drivers  of  progression  of  clinically  significant  disease. 
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